An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech
نویسندگان
چکیده
The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands-covering approximately 50-540, 540-1,700, 1,700-3,300, and above 3,300 Hz-from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated-the low &mid-high factor related to the two separate frequency ranges of 50-540 and 1,700-3,300 Hz, the mid-low factor the range of 540-1,700 Hz, and the high factor the range above 3,300 Hz-in these different languages/dialects, suggesting a language universal.
منابع مشابه
English phonology and an acoustic language universal
Acoustic analyses of eight different languages/dialects had revealed a language universal: Three spectral factors consistently appeared in analyses of power fluctuations of spoken sentences divided by critical-band filters into narrow frequency bands. Examining linguistic implications of these factors seems important to understand how speech sounds carry linguistic information. Here we show the...
متن کاملThe effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech
Ueda and Nakajima [Trans. Tech. Comm. Psychol. Physiol. Acoust., 38, 771-776, (2008); 39, 211-216, (2009)] found a consistent clustering of frequency bands common to different languages through factor analyses applied to power fluctuations of critical-band filtered speech sounds. One of the factors exhibited a characteristic shape of two peaks, which implied a correlation between a pair of dist...
متن کاملImproving Speech Intelligibility in Cochlear Implants using Acoustic Models
Cochlear implant (CI) is a prosthetic device that partially replaces the functions of the human ear via electrical stimulation. Cochlear implants are system and/or patient specific that mandates a simulation model prior to implantation. In the present work to improve the perceptual quality of the speech generated by a CI model, system specific parameters are analyzed by developing uniform bandw...
متن کاملMultidialectal Spanish acoustic modeling for speech recognition
During the last years, language resources for speech recognition have been collected for many languages and specifically, for global languages. One of the characteristics of global languages is their wide geographical dispersion, and consequently, their wide phonetic, lexical, and semantic dialectal variability. Even if the collected data is huge, it is difficult to represent dialectal variants...
متن کاملAn approach to multilingual acou devices
There is an increasing need to deploy speech recognition systems supporting multiple languages/dialects on portable devices worldwide. A common approach uses a collection of individual monolingual speech recognition systems as a solution. However, such an approach is not practical for handheld devices such as cell phones due to stringent restrictions on memory and computational resources. In th...
متن کامل